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Abstract. Train tracks with a single vertex are a generalization of interval exchange maps. 
Here, we consider non-classical interval exchanges: complete train tracks with a single vertex. 
These can be studied as a dynamical system by considering Rauzy induction in this context. 
This gives a refinement process on the parameter space similar to Kerckhoff's simplicial 
systems. We show that the refinement process gives an expansion that has a key dynamical 
property called uniform distortion. We use uniform distortion to prove normality of the 
expansion. Consequently we prove an analog of Keane's conjecture: almost every non- 
classical interval exchange is uniquely ergodic. 



1. Introduction 

Here, we are interested in the dynamical properties of non-classical interval exchanges. The 
idea behind classical interval exchange maps is to partition an interval / into d subintervals, 
permute these subintervals and then glue them back, preserving their orientation, to get 
back the original interval /. The result is a Lebesgue measure preserving map from / to 
itself which we call classical i.e.m. in short. The data that completely determines a classical 
i.e.m. is: first, the lengths of the subintervals and second, the permutation used for gluing. 
There is a way to draw these maps pictorially. 

Draw the original interval / horizontally in the plane 
and then thicken it vertically to get two copies of /, the 
top and the bottom. The bottom interval is divided into 
d subintervals with the prescribed lengths. The top inter- 
val is also divided into d subintervals but the permutation 
is incorporated into the subdivision to decide the lengths. 
Each subinterval on the bottom pairs off, by the permuta- 
tion, with a subinterval on the top with the same length. 
Join these subintervals by a band of uniform width equal 
to their common length. As an example, a classical inter- 
val exchange with 2 subintervals is shown in Figure 11.11 

Train tracks enter the picture as follows: To each band 
we can associate a central edge that runs along the length 
of the band from the bottom to the top. If we collapse 
the bands to these central edges and the thickened inter- 
val horizontally to a vertical edge and finally collapse the 
vertical edge, we get a train track with a single vertex. 
The resulting train track has the property that every out- 
going edge is incoming on the other side of the vertex. A generalization of interval exchanges 
can be defined by relaxing this last property. The picture associated will be of a thickened 
interval with bands, in which orientation reversing bands that run from bottom to bottom 
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or top to top are allowed. Here, we shall consider tracks that are recurrent. This implies 
that if there are orientation reversing bands on one side of the thickened interval, then there 
is at least one orientation reversing band on the other side. Moreover, we assume that the 
abstract train track considered, admits an embedding into a surface with non-positive Euler 
characteristic as a complete train track. A necessary condition for this is that there is at least 
one orientation reversing band on each side. We shall call the thickened interval with bands 
associated to such a train track, a non-classical interval exchange. The Lebesgue measure 
on the interval is invariant under a non-classical interval exchange; the precise meaning of 
which will be outlined in Section [2j 

For classical i.e.m.'s Keane conjectured that if the reshuffle permutation is irreducible, 
then for almost all widths of the bands, the associated classical i.e.m. is uniquely ergodic. In 
particular, the Lebesgue measure on the interval is the only invariant probability measure. 
This conjecture was first proved, independently, by Masur [B] and Veech [TU]. See also Rees 
[S] and Zorich [T2]. A key tool was Rauzy induction, alternatively called splitting: this is the 
process of inducing a classical i.e.m. on a suitable subinterval by considering the first return 
map. Veech [TU] used this to develop a continued fraction expansion for classical i.e.m.'s. 
When the classical i.e.m. has two bands, it is the same as the continued fraction expansion 
of the ratio of the widths of the bands. Veech showed that this expansion had most of the 
nice properties of the classical Gauss map. He derived unique ergodicity as a consequence 
of this. 

A similar conjecture was also made for transverse measures for measured foliations on a 
Riemann surface. Masur proved this conjecture along with the original Keane conjecture in 
[5J; his approach relied on techniques from Teichmuller theory and applied to both contexts. 
Later, Kerkchoff [5] developed a different and more elementary approach that does not rely 
on Teichmuller theory, to include both conjectures under a unified setup. In his formulation, 
he encoded Rauzy induction as a refinement process on the parameter space associated to a 
classical i.e.m. For measured foliations, Rauzy induction generalizes to an induction process 
called splitting train tracks. So the refinement process makes sense also for the parameter 
space associated to a complete train track. 

For a classical i.e.m. with d bands, the parameter space at each stage of the process can 
be identified with the standard simplex of dimension (d — 1). Rauzy induction splits this 
simplex into two halves, each a (d — l)-simplex itself. A point in the original simplex picks 
out the half in which it lies. In addition, there is a map from the new parameter space to 
the starting parameter space. It identifies the standard simplex with the half that is picked 
out. After this identification, the simplex given by the new parameter space becomes the 
stage to which the induction is applied and the process continues. Associated to each stage 
given by a finite sequence of inductions, there is a map from the current parameter space 
back into the starting parameter space. This map is defined iteratively as a composition of 
the maps associated to individual steps in the sequence. 

It turns out that almost surely, iterations give an infinite expansion consisting of nested 
simplices on the parameter space. It can be shown that there is a canonical identification 
of the infinite intersection of these nested simplices with the set of invariant probability 
measures. The refinement process can be defined purely abstractly; Kerkchoff called this 
a simplicial system and the resulting expansion, a simplicial expansion. He showed that if 
a simplicial system satisfies a combinatorial condition called the absence of isolated blocks, 
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then for almost every initial point in the starting simplex, the resulting simplicial expansion 
is normal: almost surely, every finite sequence of inductions that can occur in the expansion, 
does occur infinitely often. Finally, normality and the existence of finite sequences which 
shrink the diameter by a definite amount ensure that, almost surely the nested sequence 
of simplices, actually nests down to a point. If the reshuffle permutation of a classical 
i.e.m. is irreducible then Kerckhoff showed that its simplicial system has no isolated blocks. 
Consequently, almost surely, the expansion nests down to a point and so there is a unique 
invariant probability measure. This means one has unique ergodicity. 

For complete train tracks, the parameter space is the set of possible weights carried by 
the train track, normalized so that the sum of weights is one. So it is a convex subset of the 
ambient simplex cut out by the switch conditions of the train track. It has codimension in 
the simplex equal to the number of switch conditions. For a non-classical interval exchange 
with d bands, there is a single switch condition. So the parameter space is of codimension 1 
in the standard (d — l)-dimensional simplex. The main theorem we show here is 

Theorem 1.2 (Normality). For non-classical interval exchanges, almost surely, the expan- 
sion is normal, i.e. every finite non-transient sequence that can occur in the expansion, does 
occur infinitely often. 

As we will see in Section [5j combinatorially, there are finitely many subsets inside the stan- 
dard (d— l)-simplex which can be the parameter space for a non-classical interval exchange 
with d bands. We shall call each such subset a configuration space and denote them by W r . 
For all the subsequent discussion, we fix a combinatorial type 7r for the initial non-classical 
interval exchange and call it the starting stage. The starting stage is assumed to be irre- 
ducible and in a sink of the Rauzy diagram (see Section for the precise definitions). Let Wq 
be the configuration space at ttq. As shall be explained in Section [51 for almost every point 
in Wq, repeated Rauzy induction defines an infinite expansion. Let n' be a subsequent stage 
obtained from ttq by a finite sequence of inductions. Let W be the configuration space at 
7r'. The associated map /' relates the new widths of the bands to the starting widths in ttq, 
by giving a diffeomorphism from A onto a subset of itself, such that f'(W) = Wq PI /'(A). 
Fix a constant C > 1. The stage ir' is C-uniformly distorted if, for any pair of points y, y in 
W, the Jacobian of /' thought of as a map from W to W , satisfies 

C J(f')(y) 

Let ii be any stage obtained from tt by a finite sequence of inductions. Let W be the 
configuration space at tt and / be the associated map. The main technical theorem required 
to prove Theorem 11.21 is the uniform distortion theorem, stated below 

Theorem 1.3 (Uniform Distortion). There exists a constant C > 1, independent of the 
stage tt, such that, for almost every x G f(W), the associated expansion of x after tt, has 
some stage tt' , depending on x, such that tt' is C-uniformly distorted. Moreover, if ttq is in 
a sink of the Rauzy diagram, then tt' can be assumed to be combinatorially the same as tt q . 

Uniform distortion implies that the relative probability of a permissible finite sequence 
occurring right after a uniformly distorted stage, is roughly the same as the probability that 
an expansion begins with that sequence. Normality is a straightforward consequence of this 
phenomena. 
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This approach was originally outlined by Kerckhoff in [5]. For classical i.e.m.'s, he showed 
uniform distortion and consequently normality and unique ergodicity. However for complete 
train tracks, there are two issues in making this work. Firstly, the probability of a particular 
split is the ratio of the volume of the part of the configuration space that is inside the 
smaller simplex picked out by the split, to the volume of the entire configuration space. As 
we shall see in Section 0, this ratio can be very different from the proportion of the volumes 
of the ambient simplices. Secondly, splitting sequences of complete train tracks can have 
isolated blocks. These issues leave the proof of unique ergodicity for measured foliations in 
[5] incomplete. 

We do not know how to fix this for general complete train tracks. Nevertheless, we settle 
these issues for non-classical interval exchanges here, by proving Theorems 11.31 and 11.21 As 
a consequence we also show in Theorem 112.11 that the map on the parameter space given 
by Rauzy induction is ergodic. Finally, using the irreducibility criteria for linear involutions 
by Boissy and Lanneau, we indicate how to prove the theorems for geometrically irreducible 
linear involutions. 

1.4. Outline of the paper. In Section [21 we begin with the definition of a classical 
i.e.m. and see how its pictorial interpretation leads naturally to the definition of a non- 
classical interval exchange. In Section [31 we define Rauzy induction. This defines a directed 
graph associated to non-classical interval exchange with a fixed number of bands. For clas- 
sical i.e.m., this graph is called a Rauzy diagram. We continue to call it a Rauzy diagram 
for non-classical interval exchanges. In Section HI we describe the key differences between 
the Rauzy diagram for a non-classical interval exchange and the classical Rauzy diagram 
and their relevance to Theorems 11.21 and 11.31 In Section we show how Rauzy induction 
defines a refinement process on the parameter space. This associates infinite expansions by 
iterated induction to almost every point in the parameter space. We then formulate the main 
theorem, Theorem 15.101 which is the more precise statement of Theorem 11.31 In Section [6j 
we illustrate by an example the main ideas of Kerckhoff 's original proof for classical i.e.m. 
We also provide an example of a splitting sequence of non-classical interval exchange with 4 
bands that illustrates a key issue which makes the proof for non-classical interval exchanges 
more intricate. This issue arises from the fact that the parameter space for non-classical 
interval exchanges is a codimension one subset of the ambient simplex. Resolving the issue 
requires understanding the Jacobian of the restriction of the projective linear map to this 
subset instead of considering the full simplex. This is done in Section In Section [SJ we 
recall the main results in the measure theory of projective linear maps with non-negative 
entries from a standard simplex into itself, and use it to compute the measures of some stan- 
dard subsets that we will encounter in the later section. Section [9] gives the detailed proof of 
Theorem 15.101 In Section [10], we prove normality from Theorem 15.101 Finally in Section [TH 
we use normality to show unique ergodicity. As a consequence, we show in Section [TJl that 
the Rauzy map on the parameter space is ergodic. Finally, in Section [TJl we indicate why 
the theorems generalize to geometrically irreducible linear involutions. 
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2. Non-classical Interval Exchanges 

For a precise definition of a classical interval exchange, see Yoccoz [IT]. Here we will focus 
on representing a classical i.e.m. pictorially. 

Let S denote the set {1,2, ... ,d}. A classical i.e.m. is determined by the lengths (A a ), a G 
S of the subintervals and a permutation tx of S. A classical i.e.m. is represented pictorially 
as follows: In the plane, draw the interval / = [0, Yl a A a ) along the horizontal axis and then 
thicken it slightly in the vertical direction to get two copies, I x (5) and / x (—5). Call 
them top interval and bottom interval respectively. Subdivide the bottom interval into d 
sub-intervals with lengths Aj, A2, • • • , A^ from left to right. Subdivide the top interval into 
d sub-intervals with lengths A^-im, A 7r -i(2), • • • , A^-i^) from left to right. Now join each 
subinterval on the bottom to the corresponding subinterval on the top by a band of uniform 
width A Q . To determine the image of a point x G / under the i.e.m., pick the subinterval 
on the bottom in which x lies and flow x along the band to the top. The only ambiguity in 
the definition occurs at the endpoints of the subintervals because they can be thought of as 
belonging to two adjacent bands. This is removed by requiring the endpoint flow along the 
band that lies to the right. 

One can construct a train track from this picture. Recall that a train track is a 1- 
dimensional CW complex in which there is a common line of tangency to all the 1-dimensional 
branches that join at a 0-dimensional switch. This splits the set of branches incident on a 
switch into two subsets, one of which will be called the set of outgoing edges and the other 
the set of incoming edges. In addition, one assigns non-negative weights to the branches 
so that the sum of the weights of the outgoing branches at a switch is equal to the sum of 
the weights of the incoming branches. In our picture, each band has a central edge which 
joins the midpoint of the subinterval on the bottom to the midpoint of the corresponding 
subinterval on the top. Retract each band to its central edge and the thickened interval to a 
vertical edge. It is clear that this can be done in a way such that the edges associated to the 
bands share a vertical line of tangency on each side of this vertical edge. Finally, retract the 
central vertical edge to a point. The result is a train track with a single vertex. It has the 
property that every outgoing edge from the bottom is incoming from the top and vice versa. 
Also, assign the width of the associated band as the weight on the branch. It is immediate 
that this assignment of weights is consistent with the switch condition. 

The first step towards defining non-classical interval exchanges is to relax the constraint 
that every outgoing edge from the bottom be incoming from the top and vice versa, i.e. to 
allow bands that run from bottom to bottom or top to top. We call such bands orientation 
reversing because the flow along such a band reverses the orientation of the subinterval. 

A train track is recurrent if there is an assignment of weights satisfying the switch con- 
ditions, such that all the weights are strictly positive. Here, it implies that if there are 
orientation reversing bands on one side then there has to be at least one orientation revers- 
ing band on the other side. The interval exchange that we get from this picture shall be 
called a recurrent interval exchange. 
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We define a non- classical interval ex- 
change to be an interval exchange whose ab- 
stract train track admits an embedding into 
some oriented surface with non-positive Eu- 
ler characteristic, as a complete train track 
on the surface, i.e. the complementary re- 
gions to the embedded track are either ideal 
triangles or once punctured monogons. It Figure 2.1. A non-classical 

follows that a non-classical interval exchange interval exchange 

is recurrent; moreover it is necessary that 
there is at least one orientation band on each 

side. An example of a non-classical interval exchange is shown in Figure 12.11 the train track 
on the right is a complete train track on a sphere with 4 punctures. In the figure the 
punctures are indicated by the dots. 

It must be pointed out that once there are orientation reversing bands, there is no actual 
map from the interval I to itself. In a classical i.e.m. one can orient the bands to flow from 
the bottom to the top. This is no longer possible once there are orientation reversing bands. 
The point is that the non-classical interval exchanges carry vertical measured foliations on 
the Riemann surface that are associated to quadratic differentials , whereas the classical 
i.e.m. 's carry vertical measured foliations that are associated to abelian differentials. 

Here, we shall work with labeled non-classical interval exchanges. A labeling is a bijection 
from S to the set of bands. Non-classical interval exchanges with the same exchange combi- 
natorics but different labeling shall be regarded as different. We shall denote a non-classical 
interval exchange by T = ((A a ),7r), where 7r shall denote the exchange combinatorics and 
(A a ) specifies the widths of the bands according to the labeling. We call the positions on the 
top and bottom that are rightmost on the intervals the critical positions. 

3. Rauzy induction 

We shall now describe Rauzy induction. Since the underlying picture is that of an inter- 
val with bands, Rauzy induction for a non-classical interval exchange is exactly like Rauzy 
induction for a classical i.e.m. Iterations of Rauzy induction on a classical i.e.m. give an ex- 
pansion analogous to the continued fraction expansion. In fact, when the classical i.e.m. has 
just two bands, the expansion by Rauzy induction is the same as the continued fraction 
expansion of the ratio of the widths of the two bands. 

Suppose T is a non-classical interval exchange. Let ao and a% be the labels of the bands 
in the critical positions with a± on the top. First, suppose that A ai > A ao . Then we slice as 
shown in Figure [37T1 till we hit the original interval for the first time. The ai band remains in 
its critical position, but typically a band with a different label a' moves into the other critical 
position. Furthermore, the new width of a% is A^ — A ao . All other widths remain unchanged. 
If A ai < A Q0 instead, we slice in the opposite direction, which in Figure 13.11 would be the 
analogous operation after flipping the picture about the horizontal axis. In either case, we get 
new non-classical interval exchanges with combinatorics and widths as described above. The 
operation we just described is called Rauzy induction. The non-classical interval exchange 
we get by Rauzy induction carries an induced labeling. Since Rauzy induction is represented 
pictorially by one band being split by another, it's also called a split. This is consistent with 
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Figure 3.1. Rauzy Induction 



the notion of a split in the context of train-tracks; if the interval exchange is thought of as a 
train track then Rauzy induction is the same as splitting the train track. Iterations of Rauzy 
induction are called splitting sequences. 

For a classical i.e.m., Rauzy induction is the first return map under T to the interval 
I' = [0> * n ^ ne fi rs t instance and V = [0, ^2 a ^ a X a ) in the second. 

Let M Q/ g be the dx d-matrix with the (a, (3) entry 1 and all other entries 0. The relationship 
between the old and new width data can be expressed by 

A = EX' 

where the matrix E has the form E = I + M. In the first instance of the split, when 
X ai > X ao , the matrix M = M aiao ; in the second instance of the split, when A ao > A ai , the 
matrix M = M aoCtl . Thus, in either case the matrix E is an elementary matrix, in particular 
E G SL(d; Z). If B is any d x d matrix then in the instance when A ai > X ao , the action on 
B by right multiplication by E has the effect that the «o _ th column of B is replaced by the 
sum of the cto-th column and «i-th column of B. We phrase this as: in the split, the «i-th 
column moves ao-th column. Similar statements hold when A ao > A ai . 

One question that remains is what happens when A ai = A ao ? The split is undefined as it 
stands. As we shall see in Section 0, the expansion by splitting repeatedly is infinite almost 
surely. 

4. Rauzy diagram 

One can construct an oriented graph for non-classical interval exchanges similar to the 
Rauzy diagram for a classical i.e.m. However, there are some key differences which we now 
explain. 

Construct an oriented graph G as follows: the nodes of the graph are combinatorial types 
tt of labeled non-classical interval exchanges with d bands. Draw an arrow from tt to n', if 
7r' is a combinatorial type resulting from splitting tt. For each node tt, there are at most two 
arrows coming out of it. A splitting sequence gives us a directed path in G. 

It is possible to construct a different oriented graph G in which the nodes are combinatorial 
types of non-classical interval exchanges with d bands without labeling. The arrows are drawn 
as before. The graph G is analogous to the reduced Rauzy diagram for irreducible classical 
i.e.m. See Section 4.2 of Yoccoz [11] . There is an obvious map from G to G given by 
forgetting the labeling. 
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For irreducible classical i.e.m.'s, each connected component of the Rauzy diagram is 
strongly connected, i.e. any node can be joined to any other node by a directed path. This 
implies that the reduced Rauzy diagram is also strongly connected. 

This is not the case with non-classical interval exchanges. For example, the reduced Rauzy 
diagram for complete genus-2 interval exchanges is not strongly connected. See Dunfield and 
D.Thurston [I] and for the general case Boissy and Lanneau [T]. In Section 0, we shall see 
why this issue is not significant for the measure theory. 

In our discussion, we shall always refer to the reduced Rauzy diagram G. 



In this section, we analyze the expansion by splitting sequences on the space of widths 
of the bands. It turns out that the space of invariant probability measures embeds into the 
space of widths and the iterative refinement process defined by the splitting sequences, in 
the limit, nests down to the set of invariant probability measures. So the basic idea, due to 
Kerckhoff, is to show that for almost every non-classical interval exchange, one nests down 
to a single point. 

5.1. Description of the parameter space. For all non-classical interval exchanges T 
sharing the same combinatorics it, a choice of labeling gives a bijection between the set of 
bands and the standard basis of M. d . Let {e a } denote the standard basis under the bijection. 
We get a map from the set of such T into (M>o) d \ {0} by thinking of (A Q ) as co-ordinates 
for e a . Denote the image by W. The normalization ^2 X a = 1, restricts W to lie in the 
standard (d — l)-simplex A. The switch condition satisfied by the interval exchanges is the 
other constraint that points in W satisfy. Let S t and Sb be the labels of orientation reversing 
bands on the top and the bottom respectively. The switch condition is equivalent to 



Thus W is the intersection with A of a codimension 1 subspace of M. d . For a G St and 
(3 G Shi l e t e a/3 be the midpoint of the edge [e a , ep] of A joining the vertices e a and ep. The 
subset W is the convex hull of the points e a p and e p for p S t U Sb- 

There are finitely many pairs (St, Sb) of disjoint subsets of 5*. So there are finitely many 
convex codimension 1 subsets of A that could be W. We let X be the set of possible pairs 
(S t , Sb) and call the subsets of A the pairs define as configuration spaces. We index elements 
of I by r and whenever necessary, index the corresponding configuration space as W r . 

The full parameter space for a non-classical interval exchange with d bands is a disjoint 
union of configuration spaces, each of which is given by a combinatorial type and a choice 
of labeling. The labeling once specified initially induces forward under splitting sequences. 

5.2. Irreducibility. For the initial discussion in this sub-section, we only assume that our 
interval exchanges are recurrent and formulate a notion of irreducibility for them. Subse- 
quently, we show that if we assume that our interval exchanges are complete, i.e. by definition 
non-classical interval exchanges, then they are irreducible. 

Definition 5.3. A combinatorial type it of a recurrent interval exchange is combinatorially 
reducible if S can be written as a disjoint union S\ U S2 of nonempty subsets such that for 
all a G S\ and (3 G S2, the ends of a bands occur to the left of the ends of [3 bands. 



5. Dynamics 
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This means that for some choice of widths, the interval / can be cut into two intervals 
Ii and I 2 such that all the S\ bands are incident on I\ and all the S 2 bands are incident 
on I 2 . Thus, 1\ and I 2 define interval exchanges by themselves with #Si and #S 2 bands 
respectively. So for some choice of widths, the original interval exchange is obtained by 
concatenating two disjoint interval exchanges with fewer bands. 

When does combinatorial reducibility imply measure reducibility? In other words, if 7r is 
combinatorially reducible as S = Si U S 2 , does this decompose / into interval exchanges l\ 
and I 2 for all widths? The answer is straightforward. 

Proposition 5.4. A combinatorial type n, combinatorially reducible as S = Si U S 2 , is 

measure reducible if and only if all the orientation reversing bands in it are entirely in Si or 
entirely in S 2 . 

Proof. Let Si^ and Si }t denote the set of orientation reversing bands in Si that are incident 
on the bottom and top respectively, n is measure reducible if and only if the additional 
constraint 



gives the defining equation of W or is vacuous. If the former, then all the orientation reversing 
bands of n have to be in Si. If the latter, then all the orientation reversing bands of n have 



Definition 5.6. A combinatorial type tt of a recurrent interval exchange is said to be irre- 
ducible if it has no measure reduction. 

The next proposition tells us that a non-classical interval exchange is always irreducible. 

Proposition 5.7. Suppose n is the combinatorial type of a non-classical interval exchange. 
Then n is irreducible. 

Proof. Suppose that tc admits a measure reduction S = Si L\S 2 . By the previous proposition, 
all the orientation reversing bands in it must belong entirely say to Si. Mark the point on 
I that is the common endpoint of both the intervals I\ and I 2 . It's clear that one can first 
slice the train track vertically through this common point to break it into a union of two 
disjoint tracks and then embed it into the surface. This means that any foliation carried by 
the embedded copy of the original track is now carried by the embedded copy of the new 
track. Now one of the complementary regions for the new track is not an ideal triangle. So 
the foliations cannot be complete, which is a contradiction. □ 

5.8. Iterations of Rauzy induction. We shall restrict our initial choice to the set of non- 
classical interval exchanges with fixed combinatorics ttq and a fixed labeling. The choice 
picks out a connected component in the full parameter space. Identify the set of normalized 
non-classical interval exchanges T compatible with these choices with a configuration space 
W ro in the standard simplex A. After the identification, we shall refer to T by the point x 
in W ro that represents it. 

The first split for x = (A a ) picks out an arrow starting at 7r . We denote the endpoint of 
the arrow by 7r X; i and denote the elementary matrix associated to this split by £x,i- The 
new widths (A^) of the bands are related to the previous by 



(5.5) 




/9eSi, t 



to be in S 2 . 



□ 
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Given a matrix A with non-negative entries, we define the projectivization J A as a map 
from A to itself by 



^(y) = 




where if y = (y±, y^ • ■ ■ , yd) in co-ordinates then |y| = \y ri \. This shall be the norm used 
throughout. The norm is additive on M> , i.e. for y, y' in M> , |y + y'| = |y| + |y'|. 

In this instance, J-E x ,i has the property that the inverse image of W ro under JE x i is some 
configuration space W n E F. This implies that for x^ = A < - 1 - > /|A < - 1 ^| G W ri 

J£ X)1 (x«)=x 

The same operation as above can be now applied to the point x^ 1 ) . Iterating in this manner, 
one gets a directed path in G with vertices 7r Xjn , and for each split 7r xn -i — > 7r X) n the 
associated elementary matrix E Kjn . In addition, we get a sequence of configuration spaces 
W Tn and points x^ n ^ e W rn , constructed iteratively, such that they satisfy x*™ -1 ) = JE Xjn x.^ n \ 
Let 

Qx,n — E x ^E Xt 2 ■ ■ ■ E x>n 

The actual (or un-normalized) widths of the bands at each stage 7r x n satisfy x = Q Xtn \^ n ' 
and hence are in the projective class of x^. The sets JQ x ,n(W rn ) form a nested sequence in 
W ro , all containing x. We thus have an expansion for x, in which each stage of the expansion 
is encoded by a determinant 1 projective linear map with non-negative entries. 

Now we show that almost surely, one gets an infinite expansion. Recall that the set for 
which a split is not defined is given by A ao = A ai , where «o and a.\ are the bands in the 
critical positions. This is always a codimension 1 condition in A; hence for classical i.e.m. it 
is measure zero. The set of classical i.e.m. for which the splitting sequence stops in a finite 
number of steps is a countable union of such measure zero sets and hence measure zero. 

For non-classical interval exchanges, A Q0 = X ai is a codimension 1 condition in the con- 
figuration space, except when the «o and ot\ are the only orientation reversing bands in the 
interval exchange. In this case, we simply amalgamate together the ends of «o and a\ in 
the critical positions into a single band and cut that part of the band out of the interval /. 
This makes it measure equivalent to a classical i.e.m. with id — 1) bands. So almost surely 
iterated splitting gives an infinite expansion. 

An infinite expansion traces out an infinite directed path in the reduced diagram G. As 
mentioned before, connected components of G need not be strongly connected. However each 
connected component has at least one strongly connected piece i.e. any vertex in this piece 
splits down to any other vertex in the piece and moreover no vertex in the piece can split 
to something outside. Dunfield and D.Thurston [1] call this a sink. In theory, a connected 
component of G can have more than one sink. More generally, for linear involutions, Boissy 
and Lanneau PQ give a precise criteria under which a linear involution splits down to end up 
in some sink. The vertices that are not in any sink are called transient vertices. They are 
transient in the sense that if ttq is not in any sink, then a generic expansion has some initial 
finite portion consisting of transient vertices after which it ends up in some sink. Once it is 
in a sink, all the vertices from then on are non-transient. In particular, depending on the 
initial sequence of transients, different expansions may end up in different sinks. For our 
purposes, these subtleties do not matter because the issue of uniform distortion in almost 
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every expansion makes no reference to which sink we are in. So to simplify matters, we shall 
assume that 7To belongs to some sink of G to avoid transient vertices completely. 

A finite directed path 7r — > ~K\ — > . . . — > 7r n in G shall be called a stage in the expansion. 
There are elementary matrices E { associated to each split 7Tj_i — > 7Tj. Let 

= -^1-^2 • • • -En 

The image of JC} n in is the set of all x e VV ro whose expansion begins with this finite 
sequence i.e. 7r Xii = 7Tj for all i < n. 

Let W rn = (^Qn) -1 Wr - Subsequently, we will be interested in estimating the Lebesgue 
measure of subsets of JQ n (W rn ). Here, by Lebesgue measure we mean the probability 
measure on W ro given by the (d — 2)-volume form induced on it as a sub-manifold of A, 
normalized so that the total volume of W ro is 1. 

For example, to get an estimate of £(JQ n (W rn )), we first push-forward, by JQ n , the volume 
form on W Tn . Since there are finitely many configurations spaces, the volumes of any two 
configuration spaces W r and W s differ up to some factor that depends only on d. So now 
compare the actual measure on W ro to the push-forward. The Radon-Nikodym derivative of 
the actual measure with respect to the push-forward is just the Jacobian of JQ n , restricted 
as a map from W Tn to W ro . So integrating the Jacobian over W Tn gives us £{JQ n {W rn )) up 
to the factor that relates the volumes of the two configuration spaces. This shows that to 
give quantitative estimates, one needs to understand the Jacobian of JQ n restricted as a 
map from W Tn to W ro . We denote this Jacobian by J(JQ n ). 

Suppose a stage 7r„ is combinatorially the same as ttq and suppose T is a finite splitting 
sequence starting from ttq. If the Jacobian J{JQ n ) is roughly the same at all points, then 
the relative probability that Y follows n n is also roughly the same as the probability that an 
expansion starts with T. We make the notion of the Jacobian being roughly the same at all 
points, precise below. 

Definition 5.9. Suppose tt — > 7Ti — > . . . n n is a finite directed path in the reduced diagram 
G and Q n the associated matrix. For C > 1, we say that the stage ir n is C-uniformly 
distorted if for all y, y' G W r 

i JVQnHy) c 
c ~ J{JQ n ){y') ~ 

At this point, we adopt some conventions: At any stage, points in W ro or more generally 
in its ambient simplex shall be denoted by x's, points in W Tn or more generally in its ambient 
simplex by y's, points in Q n W rn or more generally in the image under Q n of the ambient 
simplex by w's. In all cases, we use suitable subscripts whenever necessary. From the 
previous discussion y = yS n \ where x( n ) is the ra-th point constructed iteratively in the 
expansion of x. The main technical theorem is 

Theorem 5.10. Suppose n — > 7Ti — > . . . — > ir n is a stage in the expansion. There exists a 
constant C > 1, independent of the stage, such that for almost every x e JQ n (W rn ), there 
is some m> n, depending on x ; such that the stage 7r x m is C-uniformly distorted. 

6. KERCKHOFF'S APPROACH: SOME EXAMPLES 

We fix some notation. The initial labeling at tc induces a labeling at any later stage. So 
it makes sense to talk of the band with label a at any stage. Let Q be the matrix associated 
to a later stage. Then the column of Q corresponding to a shall be denoted by Q(a). 



n 



Before we present examples, we note that for classical i.e.m. Equation (17.11) implies that 
C-uniform distortion is equivalent to the C 1//d -distribution of the columns of the associated 
matrix i.e. the ratio of the norms of any two columns of the matrix must be in the interval 
{l/C x l\C 1 ' d ). 

As the first example consider a classical i.e.m. with two bands. There is just one combi- 
natorial type of such an interval exchange and hence the Rauzy diagram has just a single 
vertex. At the starting stage, we shall label the band in the critical position on the bottom 
as 1 and the other band as 2. At every stage, we normalize so that the sum of the widths 
of the bands is one. If at any stage, band 1 splits band 2, then we denote the split by the 
symbol L and if it happens the other way round, then we denote it by R. The matrices 
corresponding to L and R which we also denote by the same letters are: 



1 1 
1 



R 



1 
1 1 



Suppose Q is the matrix at some stage in the expansion and suppose that |Q(2)| > |Q(1)|. 
Following this stage, as long as band 2 keeps splitting band 1, the first column keeps moving 
the second column. Hence the new matrix has the property 

Q'(2) = Q(2) + kQ(l), Q'(1)=Q(1) 

where k is the number of times band 2 has split band 1. So the norm of the second column 
keeps increasing while the first column stays the same, making the inequality |Q(2)| > |Q(1)| 
more and more pronounced. But then, as soon as band 1 splits band 2, the second column 
moves the first. So using additivity of the norm, we get the inequality 

2 |Q'(2)| 

Thus the columns become 2-distributed. To summarize, as long as a sequence of L's occur, 
the columns get farther and farther from being nicely distributed. The resulting stage is 
farther and farther from being uniformly distorted. But as soon as a R occurs after that, 
the columns get 2-distributed, resulting in a stage that is uniformly distorted. 

Now using the measure theory of projective linear maps from a 1-simplex to itself, specif- 
ically Equation (18.91) . it can be shown that the probability of the second column increasing 
in norm by a factor of K due to a sequence of L's, is bounded away from 1 by a quantity 
that depends only on K and is independent of the stage. This means that with a definite 
probability, some R must occur, giving us 2-distribution and an instance of the theorem. 

To handle classical i.e.m. 's with d bands, Kerckhoff [5] first proves a similar proposition 
about increase in norm: At any stage in the expansion, the probability that a band is never 
split, before the norm of its column increases by a large enough factor K, is bounded away 
from 1. The bound depends only on K and d and is independent of the stage. In fact, the 
bound monotonically goes to zero as K goes to infinity. As in the example above, the proof 
of this proposition uses Equation (18. 9p . 

After this, Kerckhoff proves the following inductive step: Suppose our stage has a collection 
of C"-distributed columns that also includes the column with the largest norm. Then there 
is a definite probability that one gets a larger collection of C"-distributed columns that 
includes the largest column at that instant, C" depending only on C and d and independent 
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of the stage. Iterating the inductive step shows that with a definite probability, one must 
get C-distributed. 

The basic idea behind the inductive step is: Before the norms of the columns in the 
collection increase by a factor of K, suppose one of the following happens: either an outside 
column becomes the column with the largest norm or a column in the collection moves a 
column from outside. At this point, if we add the outside band to our collection, then 
similar to the example above, the ratio of the norms of any two columns in it is within 
suitable bounds, even if initially, the ratio is way out. So it remains to show that with a 
definite probability, one of the two events happens. 

The key idea is that this happens provided there are no isolated blocks. An isolated block 
is a splitting sequence in which there is a collection of bands that satisfy the properties: First, 
every band in the collection is moved, at least once, by some other band in the collection 
and second, every band in the collection moves some other band in the collection but never 
moves a band outside the collection. Kerckhoff shows that for irreducible classical i.e.m., 
isolated blocks are absent. 

The proposition about increase in norm implies that with a definite probability, every 
band in the collection has to move some other band before its norm increases by K. But it 
could so happen that it moves a band in the collection itself. If this repeats enough number 
of times, then Kerckhoff shows that there is a sub-collection that forms an isolated block. 
The number of times it needs to repeat is independent of the stage. Finally, since isolated 
blocks are absent, there is a definite probability that a band in the collection has to move a 
band outside, which is exactly the kind of split we want to finish the proof of the inductive 
step. 

With non-classical interval exchanges, there are two issues: First, the probabilities of splits 
are different from the ratios of the measures of the ambient simplices. This phenomena makes 
the precise statement of the proposition about increase in norm incorrect in this context. 
Second, isolated blocks are possible. To illustrate the first issue, consider the splitting 
sequence in Figure 16 . 11 




Figure 6.1. Splitting Sequence 
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The matrix Q associated to the stage is 



10 

n + 1 n+2 

2 n n + 1 

1 



Denote the simplex with the columns of Q as vertices by A(Q). Let the image under Q, of 
the configuration space for the stage be W(Q). 

As in section [51 let be the midpoint of the edge [e*, e^-j. As shown in Figure |6T2| in 
the standard simplex, the initial configuration space W is the quadrilateral with vertices 
ei3, ei4, e 24 and e 23 - 



e 3 




ei ei4 e 4 

Figure 6.2. in A 



Let Q(ij) denote the midpoint of the edge [Q(i),Q{j)) of A(Q). As shown in Figure [673 
W{Q) is the quadrilateral in A(Q) with vertices Q(12), Q(13), Q(34) and Q(24). 



Q(2) 




Q(l) Q(13) Q(3) 

Figure 6.3. in A(Q) 
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The columns representing the vertices of W(Q) are 



Q(12) = 


1/2 
(n + l)/2 
(n + 2)/2 



,Q(13) = 


1/2 
(n + 2)/2 
(n + 3)/2 



Q(34) = 




(n + 2)/2 
(n + l)/2 
1/2 


,Q(24) = 




(n + l)/2 
n/2 
1/2 



Projectivize to find the images of these vertices in the configuration space W of the starting 
stage 



JQ(12) = 



JQ{U) 



l/2(n + 2) 
(n + l)/2(n + 2) 
1/2 




1/2 

(n + l)/2(n + 2) 
l/2(n + 2) 



,JQ(13) = 



,JQ(24) 



l/2(n + 3) 
(n + 2)/2(n + 3) 
1/2 




1/2 
n/2(n + l) 
l/2(n + l) 



JQ(12) JQ(13) 




Figure 6.4. Projection of W(Q) 
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In terms of the vertices of the quadrilateral W, we get the linear combinations 



JQ(12) 
JQ(13) 
JO (34) 
JO (24) 



1 



n + 2 
1 

n + 3 
n + 1 
n + 2 

n 
n + 1 



ei3 + 
ei3 + 
e 2 3 + 
e 2 3 + 



n+1 
n + 2 
n + 2 
n + 3 
1 

n + 2 
1 

n+1 



e 2 3 



e23 



e 2 4 



e 2 4 



By symmetry, the quadrilateral W is a square with side length l/y/2. W(Q) projects 
down as shown in the Figure 16.41 From this we calculate the area of the quadrilateral 
JQ(12) JQ(24) JQ(34) JQ(13) to be l/2(n + l)(n + 2)(n + 3). 

Now imagine that in the next split band 1 splits band 2. As shown in Figure I6.5[ the 
split is represented by the smaller simplex with vertices Q(2), 0(3), 0(4) and Q(12). The 
part of W(Q) that lies inside this smaller simplex is the part of W(Q) that lies above the 
shaded plane in Figure [6751 This is the triangle (5(12)Q(24)Q(34). Projecting the triangle 
down to W gives the triangle JQ{12) JQ{2A) JQ{34) in Figure I6T41 The area of this triangle 
isl/4(n + l)(n + 2) 2 . 




0(1) 
Figure 6.5. 



0(13) 0(3) 
Band 1 splits Band 2 



Hence the probability that band 1 splits band 2 is the ratio of the area of triangle 
JQ(12) JQ(24) JQ(34) to the area of quadrilateral JQ(12) JQ(24) JQ(34) JQ (13) viz. (n + 
3)/2(n + 2), which is close to 1/2 for large n. On the other hand, when band 1 splits band 
2, 0(2) moves 0(1)- As a result, |0(1)| increases by a factor of 2(n + 2)/3. In particular, 
by choosing a stage for n large enough, one can arrange that the probability of the split is 
close to 1/2 even though the norm increase of Q(l) is by a factor as large as we want. It is 
interesting to compare this to the ratio £( J(QE)(A))/£( JQ(A)) where E is the elementary 
matrix for band 1 splitting band 2. By Lemma 18.21 the ratio is equal to the reciprocal of 
the factor by which |0(1)I increases, i.e. 3/2{n + 2). This is completely different from the 
probability of the split. 

The example given above shows that for non-classical interval exchanges, the statement of 
the proposition about increase in norm cannot be true as it stands. However, a closer look 
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reveals that for the proof of the inductive step to work, the proposition about increase in 
norm needs to hold for the largest columns around. In Proposition 19. 5[ we show that this is 
indeed true for non-classical interval exchanges. 

The second issue is that for non-classical interval exchanges, isolated blocks arise when- 
ever there is a combinatorial reduction. However, irreducibility of a non-classical interval 
exchange implies that one splits out of these blocks with a definite probability. See Step 1 
of Proposition 19.211 

From the next section on, we begin the technical details starting with the analysis of the 
Jacobian of the restriction of the projective linear map to the configuration spaces. 

7. Jacobian of the restriction 

Let ttq — > 7i"i — > • ■ • — > 7r n be a stage. For the rest of the paper, we simplify the notation 
introduced in Section [5] by dropping the subscripts. Henceforth we will denote n n by tt, the 
configuration spaces W Tn and W ro at Tr n and 7r by W and Wo and the associated matrix Q n 
by Q. In this section, we analyze the Jacobian of JQ restricted as a map from W to Wq. 
The expression for the Jacobian of full map JQ from A to itself is known j2]. At a point 
in the configuration space W, we write down a matrix for the derivative of the full map JQ 
with respect to suitable decompositions as direct sums, of the tangent spaces to A at the 
point itself and its image under JQ. Then using the expression for the full Jacobian and the 
particular form of this matrix, we get an expression for the Jacobian of JQ as a map from 

W tO Wq. 

We fix some terminology. Given an affine subspace L of IR d and some y G M d which need 
not be in L, we let T y L be the subspace of the tangent space at y G R d parallel to L. We 
will denote a vector in R d and also in the tangent space at any point y, by the same letter 
u, whenever the context is clear. Associated to a co-dimension 1 subspace L transverse to a 
vector u, there is a projection map <p u : T y M. d — ► T y L by projecting along lines parallel to u 
till one hits L. A dilation of M. d by t > shall be denoted by p t . The derivative of a linear 
map A acting on tangent spaces is denoted by A itself. 

Let A(Q) be the simplex with vertices the columns of Q. The map JQ : A — > A is a 
diffeomorphism onto its image. This means that, for any point y G A, the vector w = Qy 
is transverse to both A(Q) and A. Hence at the level of tangent spaces w is transverse 
to T W A(Q) and T W A. So the projection map W : T w R d — ► T W A by projecting along lines 
parallel to w is an isomorphism from T W A(Q) to T W A. The full derivative D JQ(y) : T y A — > 
TjQ( y )A is the composition 

T y A % T W A(Q) ^ T W A T JQiy) A 

It is known that (see Bufetov [2]) the Jacobian J/±(JQ) of the above composition is given 
by 

MJQW = = 

Fix unit vectors m and m normal to Wq and W respectively such that the tangent bundle 
TA has the orthogonal decompositions 

TA = (lm ) © TW , TA = (Mm) © TW 

where TWq and TW are the tangent bundles of the configuration spaces. 
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Since JQ restricted to W maps it into Wq and is a diffeomorphism onto its image, at any 
point y G W the full derivative DJQ restricts to an isomorphism from T y W to Tjq^Wq. 
So with respect to the orthogonal decompositions fixed above, the matrix at y for DJQ has 
the form: 



/ 

~DJQ\TyW^T JQ(y) W 

\ ... 



* \ 

* 
* 

c(y) J 



From the matrix form we get 

(7.2) Ja(JQ)(y) = c(y)J(JQ)(y) 

To compute c(y), we use the fact that 

DJQ(y) = P| w |-i ° 0w o Q 

In the composition, we project the vector Qm G T W A(Q) by lines parallel to w to get 
<p w (Qm) G T W A. Next we write </> w (Qm) as the unique linear combination 

w (Qm) = a(y)m + p 

in T W A, where p G T^Wq. Last we apply the dilation piwi- 1 to get c(y) = a(y)/|w|. Hence 
from equation (17. 2p we get 



(7.3) 



J(JQ)(y) 



l 



a(y) w 



aCyJigyl"" 1 



m 



T„W 



T A 




Figure 7.4. Schematic picture 



Lemma 7.5. v4s y varies over W , the quantity a(y) remains constant. 
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Proof. To prove the lemma, we need to understand the projection W : T W A(Q) — > T W A. We 
refer to Figure EH which is a schematic picture of T w M d . In the figure, the projection W by 
lines parallel to w is represented by the horizontal dashed arrows. Let L be the co-dimension 
1 subspace in T w M d spanned by w and T^Wq. If A denotes the subspace of M d spanned by 
W i.e. the subspace of W d satisfying the equation that defines W , then L = T W A . So L 
does not depend on w. 

Since W gives an isomorphism from T^WiQ) to T^Wq, T w W(Q) is in L. In the figure, 
we represent L by the horizontal (x, -plane. The vector m is orthogonal to T w iy and 
transverse to w, so in fact transverse to L. 

For the remainder of the proof, we shall denote the (d — l)-dimensional subspace of lR d 
that is parallel to A by A itself. The essential point from the discussion above is: thinking of 
each tangent space T w IR d as W 1 itself, there are two (d — l)-dimensional transverse subspaces 
L and A with a vector m G A not in L. Moreover, as y varies over W which is equivalent to 
saying as w varies over W(Q), the vector w is constrained to lie in L. Any vector v G T w R d 
can be uniquely written as 

v = amo + q 

where q G L. Applying the projection W : T w R d — > T W A 

w (v) = a0 w (m o ) + w (q) = am + w (q) 

The projection W has the form w (v) = v — few. This implies that the vector w (q) belongs 
to L. So the component of w (v) along mo remains amo, even as w varies. Choosing v = Qm 
we are done. □ 

By the above lemma, we can drop the dependence of a(y) on y in the expression for 
the Jacobian and write it just as a. However the number a does depend on the stage of 
the expansion. Observe that, up to the number a, the expression for the Jacobian of the 
restriction looks similar to the Jacobian of a determinant 1 projective linear map with non- 
negative entries in dimension (d — 1). We shall make this observation precise and use it to 
compute measures in Section [91 

Next, we give the definition for a set of C-distributed vectors. 

Definition 7.6. Let C > 1. A set of vectors {ui, u 2 , . . . , u^} are said to be C-distributed if 

C |uj| 

for all i,j such that 1 < i, j < k. 

We conclude from the previous lemma that a stage tc is C-uniformly distorted if and 
only if the vertices of W(Q) are C^i-distributed. Moreover if the vertices of A(Q) are 
C-distributed, then the vertices of W{Q) are also C-distributed. So to show that for almost 
every x, there exists a stage 7r Xin that is C-uniformly distorted, it is enough to show that for 
almost every x, there exists a stage 7r x n such that the columns of C/ X ,n are C-distributed. 
The proof of this roughly follows Kerkchoff's original proof in the case of classical i.e.m.'s 
while weakening various hypothesis. 

In the next section, we recall the measure theory for determinant 1 projective linear maps 
with non-negative entries, from the standard simplex into itself. Additionally, we evaluate 
measures of certain subsets of the standard simplex, which shall be useful later. 
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8. Measure Theory 



8.1. Measure Theory of Projective linear maps. Let Q : ~R d — ► M. d be a linear map 
with non-negative entries and determinant 1. Projectivize Q to get the map JQ : A — ► A. 
We recall 4.4 Lemma 1 from Yoccoz |llj : 



Lemma 8.2. 



*W(A)) 
*(A) 



1 



n 

a=l 



Remark 8.3. The formula for £(JQ(A)) can be derived by integrating the expression 17. II for 
Ja(JQ) on A. 

The next corollary does not need the assumption that the matrix Q has determinant 1. This 
will be relevant in Step 5 of Proposition 19.51 

Corollary 8.4. Let E be an elementary matrix with the off-diagonal (/3, a) entry equal to 
R > and let Q be a linear map with non-negative entries. Then 

e(J(QE)(A)) \Q(a)\ 
£(JQ(A)) " \Q(a)\ + R\Q(P)\ 

Proof. As both sides of the claimed equality are unchanged if we replace Q by tQ, we can 
reduce to the case det(Q) = 1. By applying Lemma I8~2l we get 



£{J{QE){A)) 
£(JQ(A)) 



n \Qh)\ 

7=1 

n ios(7)i 

7=1 



\Q(a) 



IgCgOj 

\QE(a)\ \Q(a)\ + R\Q((3)\ 



□ 



8.5. Evaluating measures. We apply Corollary 18. 41 to evaluate measures of certain subsets 
of A that we shall encounter in Section 9. 

Wedges. For a pair of distinct labels a, (3 E S and a non-negative constant R, let A^(a, 0) 
be the set of points in A whose convex combination x = ^ A 7 e 7 satisfies 



(8.6) 



^>R 

An 



Figure IBTTl shows a schematic picture of Ar(o, (3). The shaded plane 
E R (a,j3) represents the set of points for which \p/\ a = R. The set 
A R (a, (3) is the region in Figure IH7H that lies below the shaded plane. 
Thus it looks like a wedge in A. 

Let Q : R d -> R d be a linear map with non-negative entries. Write 
R in the form 



for some K > 0. Then 



R K '\Qm 

JQ(A R (a,(3)) = J(QE)(A) 




e P Figure 

8.7. A R {a,/3) 



20 



where E is the elementary matrix with the (/3, a) entry equal to R. Applying Corollary 18. 4[ 

we get 

(R q) £(JQ(A R (a,(3))) 1 

1 ' ' £(JQ(A)) 1 + K 

Half-space intersections. Having established Equation (18.91) for the relative measure of 
wedges, we move on to more general subsets of A which we call thin half-space intersections. 
The wedges A R (a,/3) are an instance of these. We encounter thin half-space intersections in 
the proof of Proposition 19.51 

A half-space H in R d that intersects A and does not contain the vertex e a shall be called 
a half-space opposite a. Let Sh denote the set of those 7 7^ a for which the intersection 
dH fl [e a , e 7 ] is non-empty, where [e a , e 7 ] denotes the edge of A joining e a with e 7 . For 
R > 0, a half-space H opposite a is said to be -R -thin if for every 7 e S#, the intersection 
point a a7 = <9if fl [e a , e 7 ] has a convex combination that satisfies 

Y~ > Ro 

We have the following inclusion 
Lemma 8.10. For R = Ro/d, we have 

ffnAc [J A fl (a,7) 

Proof. Let F be the sub-simplex of A whose vertices are indexed by Sh U{a}, and let F(Rq) 
denote the set of points in F whose convex combination in F satisfies A Q < 1/(1 + -Ro)- Let 
A(i?o) be the convex hull of F(Rq) with the remaining vertices of A. 

Claim 8.11. 

HH A C A(Ro) 

Proof. Since both H D A and A(Rq) are convex sets, it is enough to show that every vertex 
of H H A is in A(Rq). By the virtue of being i?o-thin, the vertices a ai are in A(Rq). All the 
other vertices of if fl A lie in F(a), the face of A opposite e a , which itself lies in A(R ). □ 

The lemma then follows from the claim 

Claim 8.12. 

A(i? )c |J A*(a, 7 ) 

Proof. Notice that the face F{a) of A is in the union above. The vertices of A(R ) that 
are not in F lie in F a . So it is enough to show that F(R ) lies in the union. The convex 
combination of a point in F(Rq) satisfies A Q < 1/(1 + Ro). This implies that for at least one 
7 e S H , 



v 7 

It follows that 



A 7 -Ro 
A„ d 



and so the point belongs to A^(a, 7). □ 
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□ 



In Proposition I9.5[ we need an upper bound on the measure of half-space intersections. This 
upper bound is derived there by using the inclusion of Lemma 18.101 and then bounding the 
measures of the individual wedges in the union. 

We now have all the preliminaries to carry out the proof of Theorem 15.101 

9. Proof of Theorem 15.101 

Let it be a stage in the expansion. Let W be the configuration space at ix and let Q be 
the matrix associated to ir. 

Proposition 9.1. For almost every x 6 JQ(W), every a 6 S is split infinitely often and 
splits other bands infinitely often. 

Proof. The first claim is that 

Claim 9.2. Suppose in the expansion for some x ; a band a G S occurs in the critical 
positions infinitely often. Then a is split infinitely often and splits other bands infinitely 
often. 

Proof. Suppose a is split finitely many times. Then, there exists a future stage 7r x m after 
which a never gets split. So the actual width of a remains unchanged from 7r xm onwards. 
On the other hand, after 7r x m , the band a splits some other band 7 infinitely often; this is 
impossible if the actual width of a is to remain fixed. 

Alternatively, suppose a splits other bands finitely many times. A band leaves a critical 
position only if it splits some other band. Since a gets split infinitely often, there exists 
a future stage 7r Xjm after which a remains fixed in one of the critical positions and is the 
only band split thereafter. But then from 7r Xjm onwards, the actual widths of the rest of the 
bands remain unchanged because they never get split. This makes it impossible to split a 
ad infinitum. □ 

Thus, to prove Proposition I9.1[ it is enough to show that almost surely, every band a 
occurs in the critical positions infinitely often. Let Z be the set of those x in JQ(W) for 
which there exists a band a that occurs in the critical positions finitely often in the expansion 
of x. Fix such a point x and let Si be the subset of those bands that occur in the critical 
positions finitely many times in the expansion of x. Then there is a future stage 7r Xi „ such 
that in the expansion of x after 7r x n 

(1) The bands in S\ never occur in the critical positions. 

(2) Every band in 5*2 = S \ S% occur in the critical positions infinitely often. 
Moreover we assume that 7r x n is the first instance in the expansion of x in which the above 
properties hold. 

Claim 9.3. S = Si U S2 is a combinatorial reduction of 7r Xin and all future stages. 

Proof. Without loss of generality, suppose some (3 E £2 occurs to the left of some a E S\ 
on the bottom. Since a never enters a critical position, whenever f3 is split, the split has to 
begin at the other end of (3. This increases the number of bands k(a) to the left of a on the 
bottom by 1. Moreover since (3 is split infinitely often, the number k(a) has to become large 
enough to force a into the critical position on the bottom which is a contradiction. This 
proves the claim. □ 
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Let H be the set of finite splitting sequences j starting from ir such that final stage of j 
is combinatorially reducible. The set H is countable because it is a subset of the countable 
set of finite splitting sequences. 

By the above claim, we can define a map ip : Z — > H . Let Z 3 = If the set Z 3 is 

non-empty, then the expansion after 7r of each point in Z 3 begins with the sequence j. Since 
H is a. countable set, £(Z) = ^2 3e jj^(Z 3 ). So to prove Proposition 19.11 it is enough to show 
that £(Zj) = for all j in H. 

Claim 9.4. For every j in H , t{Z 3 ) = 0. 

Proof. Let S = S\ U S2 be the combinatorial reduction for the final stage tt' of j. Let (X' a ) 
denote the normalized widths at stage it'. We will show that a point in Z 3 must satisfy 
constraint (15.51) in the widths (X' a ). The set given by this constraint has measure zero, which 
then proves the claim. 

For a point x that does not satisfy the constraint, set 

D = E x '«- E A ^ 

and without loss of generality, assume that D > 0. We claim that for a point with D > 0, 
there is a stage in the expansion after tt' in which twice the sum of the actual widths of 
the bands in S2 is equal to D. When this happens, a band in S\ is forced into the critical 
position on the bottom. This would show that such a point cannot be in Z 3 . 

Let X™ denote the limit of the actual widths of a in the expansion of x. To show that 
there is a stage in the expansion of x after tt' in which the sum of the actual widths of the 
bands in S2 is equal to D, we show that A^ 3 = for all a G S 2 . We prove this by showing 
that for any e > 0, there exists a future stage at which the actual widths of all bands in S2 
are at most e. 

For e > 0, there exists a stage tt XiN after tt' such that the actual widths at ir X)N satisfy 

A~ < \i N) <X™ + e for all a in S2. Then for every subsequent split after 7t Xj at, the actual 
width of the band split can be reduced by at most e. This means that the actual width of a 
band doing the splitting is at most e. For the bands in S2, there is a future stage 7t Xi tv' such 
that every band in 5*2 has split other bands at least once after 7t X) tv but before ir X N'. By the 
previous observation, the actual widths at tt^n 1 of all bands in S2 are at most e and we are 
done. □ 

This concludes the proof of Proposition 19.11 □ 

For a G S and x G JQ(W), let n(x, a) be the index of the first stage after n such that, in 
the split vr„( x a ) — > 7r n ( X Q ) +1 the band split is a; if there is no such integer, set n(x, a) = 00. 
From Proposition 19.11 we see that n(x, a) is finite almost surely. 

For M > 1 let X M>a = {x G JQ(W) : |Q„( X)Q )(a)| > M|Q(a)|}. The set X M , a is thus the 
set of those x in whose expansion the a-column increases in norm by a factor greater than 
M before a is split. 

Proposition 9.5. Let a be a band and C > 1 be a constant such that 

\Q(a)\ > -max|Q( 7 )| 

L- -y&S 
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Then there exists a constant M > 1, depending only on C and d, such that 

In fact, for all M' > M , the proportion of X M i^ a in JQ(W) has an upper bound that depends 
only on M'; moreover the bound — > as M' — > oo 

Proof. We shall prove Proposition 19.51 in a number of steps: 

Step 1. As a first step, we shall include the set Xm,o in a finite union of sets whose measures 
are easier to estimate. 

Let Y M)Ce = (JQ)~ 1 (X M)a ). For a (3 G S and a positive constant R, recall from Section [S] 
that A R (a,/3) is the set of points in A whose convex combination satisfies 

X a 

Write R as 



R = K- 



IW)I 

for some K > and let Y R {a,(3) = A R (a, (3) n W and let X R {a,(3) = JQ(Y R (a, (3)). We 
shall show that 

Lemma 9.6. For K = (M — l)/(d— 1) we /iawe the inclusion 

Y M , a Q \jY R {a,P) 

or equivalently 

X M ,a C |J X fl (a,/3) 

Proof. Since a does not get split till 7r n ( X Q ,) , the column Qn(x,a) (°0 nas the form 

Qn(x,o)(a) = Q{ol) + y^ j c p Q{j3) 

for non- negative integers Cp. 

Claim 9.7. Let y & W be the pre-image of x under JQ. The convex combination y = 
XI A 7 e 7 mfisi satisfy 

A/3 . 

/or a// (3. 

Proof. To simplify notation, let m = n(x, a). Let T denote the matrix associated to the 
splitting sequence tc — > ■ ■ ■ — > 7r m . Starting with the normalized widths (A 7 ), 7 G S at n, let 

(A^ ) denote the actual widths at n m resulting from the splitting sequence. Since a is never 
split till 7r m , the width of a remains unchanged i.e. Xa = X a . Additionally, the entries 
of cn-th column of T are given by Tp a = cp for (3 7^ a and T aa = 1. From Section [51 the 
relationship between the old and the new widths is 
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Since all terms on the right hand side above are non-negative, we get the inequality 

finishing the proof of the claim. □ 

Claim 9.8. // |Q n (x,a)(«)| > M\Q(a)\ for some constant M > 1, then for at least one (3 

M-l \Q(a)\ 



c /3 



> 



d-1 \Q(P)\ 

Proof. By the additive property of the norm on (R> ) d 

\Q n{x ,a)(a)\ = \Q(a)\+J2cp\Q(P)\ 

If |Qn(x,a)(a)| >M|Q(a)|, then 

X>IW)l>(M-l)|Q(a)| 



which implies the claim. 



□ 



By Claims |9~T1 and |9T8| for each y £ Yj^ iCe , there is some (5 such that the convex combination 
for y satisfies 

A/3 \Q(a)\ 
Aq " \Q(J3)\ 

for K = (M — l)/(d — 1). This proves the lemma. □ 

To show that there is a choice of M large enough such that £(Xm,o) is bounded above by 
(l/2d)£(JQ(W)), it is enough to show that there is a M large enough such that for every f3, 
£(JQ(Y R (a, /?))) is bounded above by (l/2d 2 )£( JQ{W)). So for the remainder of this proof, 
we focus on one such Yr(cx.,0). 



Step 2. Figure [9T91 shows for some (3, a schematic picture of the sets A R (a,f3) and Y R (a,f3) 
inside A. The simplex is drawn such that the vertex e a is on the top and the opposite face 
F(a) is in the horizontal plane forming the floor of the simplex. 



e R (a,/3) 




e R (a,P) 



Figure 9.9. Schematic Picture 




The first picture is an instance when a is orientation preserving and the second picture is an 
instance when a is orientation reversing. Let e R (a, (3) be the point on the edge [e a , ep] whose 
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convex combination satisfies \p/\ a = R. Let Fr(o;, (3) denote the convex hull of e#(a,/3) 
and the vertices of the simplex other than e a and e@. In the picture, it is represented by the 
plane that is shaded. Then A#(a,/3) is the wedge in the simplex bounded by Fr(«,/3) on 
the top and F(a) on the bottom i.e. it is the region in the simplex below the shaded plane. 
As a result, Y R (a,j3) is the part of W that lies below the intersection of the shaded plane 
and W, which is shown by the bold line segment in either picture. 

The position of e#(a,/3) on the edge joining e a and is determined by the value of K 
and the ratio The assumption |Q(a)| > (1/C) max-ygs |Q(t)| implies that 

the ratio is bounded below by 1/C. Using this lower bound, we see that the 

points in An(a,j3) and hence in Y#(a,/3) satisfy 

As K 

< 910 > i £ c 

This means that, for every /3, the distance along the edge of the point e#(a,/3) from is 
bounded above by a quantity that depends only on M and C. Moreover, this quantity goes 
to as M goes to infinity. In other words, each wedge is uniformly thin. 



Step 3. In this step, we triangulate W by (d — 2)-dimensional simplices A(i) in a specific 
way. Then to get the upper bound for measure relative to JQ(W), it is enough to get the 
same bound for measure of the intersection with each simplex in the triangulation. 

Recall from Section [5j each configuration space W is a convex hull of a finite number of 
vertices, either of type e a p or e 7 . Fix a triangulation of W by (d — 2)-dimensional simplices 
as follows: 

(1) Triangulate dW without introducing new vertices. This can be done in any manner 
by adding an appropriate number of new diagonals. 

(2) Fix the vertex p of W where if a is orientation preserving then p = e a , and if a is 
orientation reversing then p = e ai for some 7. Cone off the triangulation of dW to p 
to get a triangulation of W . 

For example, in the second figure of Figure 19.91 we have triangulated by adding in the 
diagonal of the quadrilateral. The number of simplices A(i) in a triangulation is bounded 
above by some number n(d) that depends only on d. 

Let the individual simplices in the triangulation of W be A(i). By construction, each A(i) 
contains the vertex p of W. To show the required bound for £(X R (a, /?)), it is enough to 
show that for M large enough, the ratio £( JQ(Y R (a, (3) R A(z)))/£( JQ(A(i))) is bounded 
above by l/2d 2 for each i. Taking union over all i implies the required bound for X R (a,/3). 
So for the remainder of this proof, we will focus on one such A(i). 

Step 4- The intersection Yn(a,/3) D A(i) is a half-space intersection H R A(i) with dH = 
Fr(«,/3) (see Section[8]). In Inequality (I9.10p . if K/C > 1/2, then the half-space intersection 
is opposite p. In this step, we shall show that H R A(i) opposite p is i?/2-thin. Hence we 
can use Lemma 18.101 to include the half-space in a union of wedges in A(i). This reduces 
the task to showing a suitable upper bound for the ratio in each of these wedges. 

The edges of A(i) with endpoint p that have non-empty intersection with -Fr(«, j3) are the 
ones joining p to a vertex q of A(i) in F a . Denote the set of vertices of A(i) in F a by Sh 
and for each vertex q G Sh, let a q denote the point of intersection F R (a,f3) R [p,q]. Since 
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a q G F R (a,/3), it's convex combination in the full simplex A satisfies 

(9.11) ^ = R 

X a 

Let a q have the convex combination a q = X p p + X q q in A(z). Write 
(9-12) p = , q = 

where 7 = a or 7 7^ a depending on whether a is orientation preserving or orientation 
reversing, and similarly r\ = [3 or r\ 7^ (3. This means that A Q = X p or X a = X p /2, and 
similarly Xp = X q or A/3 = X q /2. In any case, combining this with Equation (19.111) implies 
that convex combinations in A(i) of the points on the line segment joining a q to q satisfy 

Xg > R 

X p 2 

Thus our half-space intersection with A(i) opposite p is i?/2-thin. By Lemma 18.101 for 
Rq = R/2(d — 1), we have the inclusion 

(9.13) Y R (a, 0) n A(i) c |J AQ^fog) 

Using Equation (19. 12ft . the ratio |Qp|/|Qg| satisfies 

\Qp\ _ \Q(a)\ + \Q(j)\ <(c + 1) IO(«)l 



lOffl iw)| + iqmi v y ig(/?)i 

So if we set 



7 ' ' 1 AdC J \Qq\ 
then i? > Ri- This implies the inclusion 

A(i) Ro (p,q) C A(i) Rl (p,q) 

for each g G S#. Combining it with inclusion (I9.13p . we get 

Y R {a,P)nA(i) C |J A(i) Rl (p,q) 

The number of simplices A(i) is at most n(d). So to establish the required upper bound on 
£(jQ(Y R (a, (3) fl A(i))), it is enough to show that there is a choice of M and consequently 
of Ri such that 

(9-14) £(JQ(A(i) Rl (p,q))) < ^±—£(jQ(A(i))) 

Step 5. Equation ( 18.91) applies to projective linear maps with non-negative entries of the 
standard (d — l)-dimensional simplex into itself. So to apply Equation (18. 9p to get Esti- 
mate (I9.14p . we need to extract such a map JQ : A^-2 — > A^_2 from the restriction of JQ to 
A(i). This creates two issues: First, to extract the map JQ, we have to identify A(i) with 
A d ^ 2 ] this introduces a distortion in the standard measure on A d _ 2 - So we need to show that 
this distortion is universally bounded. Second, we need to show that the Jacobian J(JQ) is, 
up to a universal constant, the same as the full Jacobian J& d _ 2 (JQ) of the extracted map. 
In the final step, we show how to do this. 
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Recall from Section [5] that the volume forms on configuration spaces are normalized so 
that the volume of each configuration space is 1. Hence, up to the ratios of the configuration 
space volumes, for a measurable subset X of JQ(W) 

(9.15) £(X) « / J(JQ)(y)xY(y)dy 

Jw 

where \y is the characteristic function of the set Y = {JQ)^ X {X). Using f)9.15p . we get 

/ J(JQ)(y)dy 
£(JQ(A(i) Rl (p,g))) _ A(i) Rl (p, Q ) 

£(jQ(A(i))) " / J(JQ)(y)dy 

A(i) 

Recall from Section the expression for the Jacobian of the restriction to W. 

1 



J(JQ)(y) 

So 



a\Qy\ 



d-1 



(9.16) 



e(jQ(A(i) Rl (p, q ))) _A(i) fll (p, g ) lQyl 

£(jQ(A(i))) I jQ^rdy 

A(i) 



In Equation (I9.16p . the integrand looks similar to the Jacobian of a determinant 1 projective 
linear map, with non-negative entries, of a standard (d — 2)-dimensional simplex A^_ 2 . We 
indicate how to make this observation precise and then use it to bound the ratio from above. 

Let A be the subspace in M. d spanned by Wo and let A be the subspace in M. d spanned 
by W. Thus A and A are subspaces of R d satisfying the equations that define W and 
W respectively. Fix some linear isomorphism F : IR d_1 — > Aq such that F(Ad-2) contains 
Wq. Also, up to a permutation of the standard basis in there is a unique linear 

isomorphism G(i) : R d_1 — > A such that G(i)(A^_ 2 ) = A(i). Use the identification G(i) to 
label the vertices of by the corresponding labels of A(z). 

Using the map F and the identifications G(i), we define a linear map Q = F~ x o Q o G(i). 
Since the projectivization JQ maps A^-2 into itself, the map Q has non-negative entries. 

Since there are finitely many configuration spaces and finitely many simplices in the trian- 
gulation of each configuration space, the distortion by G(i) of the standard Lebesgue measure 
on A^_2 is bounded above by a constant that depends only on d. This means that for a point 
y G A^_2, the quantity l/|<5G(i)y| d_1 (the integrand on the right in (19.161) ) is the same as 
the full Jacobian J& d _ 2 (JQ)(y) = l/|Qy| <l!_1 , up to a constant that depends only on d. This 
also means that up to a universal constant, the norm of the r-th column |Q(r)| is the same 
as |Q r l- 

The restriction A(i) Rl (p, q) JQ(A(i) Rl (p,q)) corresponds to J(QE) : A d _ 2 — > A d _ 2 
where E is the elementary matrix whose off-diagonal entry in the (q,p) place is Ri. 
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Applying Corollary 18.41 we get 

£(jQ(A(i) Rl (p,g))) 
l(JQ(A(i))) 



\Q(P)\ 



\Q(p)\+Rt\Q(q)\ 



\Q(p)\ 



AdC 



\Q(p)\ + 



K \ \Q P \\Q(q)\ 



AdC 



\Qq\ 



< 



4dC 



AdC + K ' K 

Finally, if M is large enough such that K > 8d 3 n(d)C, then we get Estimate (19.141) ; moreover 
it is easily checked that as M — ► oo, K —* oo and so the second part of the Proposition 19.51 
also follows. □ 



In the next lemma, we use the technique in Step 5 of Proposition 19.51 to estimate from 
below, the probability of certain splits. This shall be useful later in Proposition 19.211 

Lemma 9.17. Let 7 and a be the bands in critical positions at some stage 7Ti. Let C > 1 be 
a constant such that 

\Q(j)\ > -max|Q(a)| 

Then up to a universal constant depending only on d, the probability that 7 splits a is bounded 
below by the quantity l/n(d)(2C) d . 

Proof. Let 7r 2 be the stage resulting from 7 splitting a. Let W\ and W2 denote the configu- 
ration spaces corresponding to ~K\ and 7r 2 . 




Figure 9.18. 7 splits a 



A schematic picture is shown in Figure 19.181 with the shaded part X in W\ representing 
the split. In other words, the shaded part X is the image of W2 under the projectivization of 
the elementary matrix associated to the split. Similarly, let X' be W\ \ X; this corresponds 
to o splitting 7 instead. Except when 7 is the only orientation reversing band on its side, 
X' is non-empty. When X' is empty, the split occurs with probability 1; so we focus on the 
split in which X' is non-empty. Let Z denote dX D dX' i.e. the part of the boundary that 
separates the two sets. 

The vertices of W\ fall into the following categories: 
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(1) If a vertex has the linear combination (l/2)e CT + (l/2)e T with r ^ 7, then it lies in 
X\Z. 

(2) If a vertex has the linear combination (l/2)e 7 + (l/2)e T with r/(r, then it lies in 
X'\Z. 

(3) All other vertices of W\ lie in Z. 



Just as in Proposition 19.51 let A\%) be the triangulation of X'. Denote the set of vertices 
of A'(i) by V'{i). Let io be the index for which the measure £(JQi(A'(zo))) is the maximum 
among all i. The triangulation has at most n(d) simplices. Hence we get the estimate 

e(JQi(X')) <n{d)l(jQ x {A\i Q ))) 

Let Vz be the subset of vertices of A'(i ) that lie in Z. Construct any triangulation of X 
such that there is a simplex A(j) for which the points in Vz are vertices. Then 

t(JQ^X)) ( \ \ e(JQi(A(j))) 



£{JQx(X')) \n(d)J ^JQ^A'ito))) 

So it is enough to show a lower bound for the right side in the equation above. Let V(j) 
denote the set of vertices of A(j). Associated to the simplices A'(i ) and A(j), we define 
maps Qi(io) = F^ 1 o Q 1 o G'(io) and Qi(j) = F^ 1 o Q 1 o G(j) from the standard (d — 2)- 
dimensional simplex into itself. See Step 5 of Proposition 19.51 for the definition of these maps. 
The gist of the discussion in that step is that up to a universal constant introduced in the 
identifications, 



(919) 'Wi(Atf))) ^ H jgi(j)(A 



^Wi(A'(io))) i (jg(, )(A d _ 2 



oj 



There is a unique linear isomorphism G(ji ) of R 1 such that Qi(«o) — QiO) ^(j^ 
Since the number of identification maps G(*) is finite, the number of linear isomorphisms 

that relate them is also finite. Hence the ratio of the determinants of Qx(*) is universally 
bounded. This bound can be incorporated in applying Lemma [8721 to evaluate the right hand 
side in (19.191) . Recall that the norm of the r-th column Qi(*)(r) is up to a universal constant 
the same as the norm of Qr. In conclusion, 

i(ja(i)(A, S\ n \QM II \QM 
[ ' ) e(jQ^ )(A d _ 2 )) ~ n n lOigl 

V YU A V ?ev(j) 9eV(?)\v z 

In (I9.20p . the number of vertices in the numerator is exactly equal to the number of vertices 
in the denominator. So the terms in the products in the numerator and the denominator 
pair off. In addition, any vertex p in V'{iq)\Vz is a vertex in X'\Z i.e. in our categorization, 
the vertex belongs to category (2). Hence it's linear combination is p = (l/2)e 7 + (l/2)e r , 
and so it satisfies \Qip\ > |Qi(7)|/2. Hence for a vertex q in V(j) \ Vz, we have the estimate 

|Q lP | ^ |gi(7)l/2 . 1 
\Qxq\ max aeS \Qi(a)\ 2C 
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Using the estimate in (19.201) gives the lower bound 



£(jg 1 (j)(A d _ 2 )) / 1 y-i 





t(JQi{Wi)) l + n(ci)(2C) d - 1 n(d)(2C) 



□ 



The next proposition, which goes back to Kerckhoff [5] , has the following idea: One starts 
off by arranging the columns Q(a) in the order of decreasing norm and then picks out a 
Co-distributed subset Ao starting with the column with the largest norm and following the 
order. Then the proposition shows that there is a definite probability, independent of the 
stage, that some future stage has a larger subset A 2 Ao that is Ci-distributed and contains 
the column with the largest norm, with the constant C\ independent of the stage. Thus 
the proposition can be used as an iterative step which we iterate over a finite number of 
times to get, with a definite probability, a future stage that is C-distributed. The precise 
statement of the proposition here is identical to 4.4 Lemma 5 in Yoccoz [IT] , except that it 
is formulated for non-classical interval exchanges. 

Proposition 9.21. Let Co > 1 be a constant and let Ao C S be a set of Co-distributed 
columns which contains the largest column in norm. Then there exists constants c\ 6 (0, 1) 
and C\ > 1, depending only on Co and d, and a finite set of future stages Tie, after n, that 
satisfy the following two conditions: 

(1) Denote the matrix corresponding to each ttq by Qq and the configuration space by Wq. 
The sets JQoiWo) have disjoint interiors and 
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(2) For every 9, there exists a set of C\- distributed columns Ao 2 -4o that contains the 
largest column at the stage ttq. 

Proof. We recall the basic idea of the proof from Section [6j Before the norms of columns in 
Ao increase by some factor, suppose one of the following happens: either a column outside 
Ao becomes the column with the largest norm or a column in Ao moves a column outside. 
At this point, if we let A be Ao union this outside column, then as we shall see, the new 
collection is Ci-distributed where C% depends only on the initial constants. Then it remains 
to show that with a definite probability, one of the two events happens. 

The proof here follows the proof of 4.4 Lemma 5 in Yoccoz [11] closely, except that the 
individual steps require justification for non-classical interval exchanges. The steps are as 
follows: 

Step 1. To make it possible for a column in Ao to move a column outside, a column outside 
Ao needs to land in one of the critical positions. In this step, we show that there is a definite 
probability that this happens before the maximum of the norms of the columns increases by 



^2e{jQ {W0))> Cl e{jQ(w)) 
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a definite factor. Precisely, we claim that it can be split to a future stage ir L such that, for 
constants c[ G (0, 1) and C[ > 1 depending only on Co and d, we have 

(9.22) max|Q t (a)| < C(max|Q(a)| 

(9.23) t{JQ,{W.)) > cWQ{W)) 

and at least one of the bands in the critical positions in 7r t does not belong to Ao- 

Proof. Consider the sink Go of the reduced diagram G that contains 7r . The nodes in G 
correspond to unlabeled combinatorial types. By Proposition 19.11 for each node 7r in Gq and 
for each band in ir, there are splitting sequences which bring that band to a critical position. 
For each band in tt, choose one such splitting sequence. The set of chosen splitting sequences 
over all nodes of Go is equal to d times the number of nodes in Go- This means that there 
is an upper bound h, depending only on d, on the lengths of all chosen splitting sequences. 

Now we introduce the labeling. For some band (3 in S\.4.o> let be the chosen splitting 
sequence starting from n that brings f3 into one of the critical positions. Let t be the shortest 
prefix of u{j3) that has a band, which is not in Ao, in a critical position. 

By construction, the length of t is at most h. This implies that the norm of a column in 
Ao participating in i can increase by a factor of at most 2 h . So we get the estimate 

max \Q,(a)\ < 2 /l max \Q(a)\ 

Thus (I9T22D holds with C[ = 2 h . 

To show the bound f!9.23j) for the measures, it is enough to show that every split in i has 
relative probability bounded below by a constant that depends only on Go and d. 

Suppose we are at some intermediate stage 7Ti of i. Following tti, suppose a band 7 in Ao 
splits a band a in Ao to give us the next stage 7r 2 . The column Q\{pf) satisfies the estimate 

\QM\ > |Q(7)I > 7^max|Q(a)| > -^ 7 max|Q 1 (a)| 

By Lemma [9.171 the probability of this split is bounded below by l/n(<i)(2CoC{) a! , up to a 
universal constant that depends only on d. Finally, using the fact that the length of 1 is 
bounded above by h, we get 



So setting 



Ci 



^ n(d) h (2C C[) dh 

proves Step 1. □ 

Step 2. Notice that for every a G Ao, we have |Q t (a)| > (1/G^)max 7€ 5 |Q t (7)|. So we can 
apply Proposition 19.51 to the stage 7r t . Let 

X M , a = {x G JQ L {W L ) : \Q nM (a)\ > M\Q t (a)\} 
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By Proposition I9.5[ there exists M > 1, depending on C[ and d such that £(Xm,o) < 
{l/2d)£{JQ L {W L )). If we let 

X = JQ t (W t ) \ |J X M , a 

then we have the estimate 

(9 24) ({X) :~ 1 V > ! V 1 ; 1 



Form this point on, the rest of the proof follows YoccozJTT] almost verbatim. We include it 
here for completeness. 

Step 3. Since at 7r t , a band outside Aq is in a critical position, it is not possible to get a stage 
in which both bands in the critical positions are in Aq unless a band in Aq moves a band 
outside. This observation can be exploited to show that with a definite probability, either a 
band outside Aq becomes large enough in norm or a band in Aq moves a band outside. 

For each x £ X, let 7r X;m denote the stage after ir L such that vr x>m _i — > 7r X)jn is the first 
instance when a band in Aq is split. To each x £ X, we associate a stage 7r(x) in the path 
7r t — > • • • — > 7r x m as follows: 

Case (1). If at some intermediate stage ir' 

(9.25) |Q'(/?)| >max|Q t (a)| 

for some /? £ S\Aq, then we let 7r(x) be the first instance when Inequality (19.251) is true for 
some band in S \ Aq. This also means that in 7r(x), there is a unique band (3 £ S \ Aq that 
satisfies Inequality (I9.25p . 

Case (2). If no band in S \ Aq satisfies Inequality (19.251) at any intermediate stage, we set 
7r(x) = 7r X)m . Here a band (3 £ S \ Aq splits a band a £ Aq in the final split before 7r(x). So 

(9.26) |Q tf (/3)|HQ«-i(/3)l + lQ«-i(a)l 

Step 4- Given the estimate (I9.24p . it is possible to select a finite number of stages 7Cg from 
the collection of stages 7r(x) constructed in Step 3 such that JQeiWe) have disjoint interiors 
and 

£(U e JQe(W e )) > ±£(X) > ^£(JQ L (W L )) 
For each TTg as above, let A be the set of 7 £ S that satisfy 

(9.27) \Q e ( 7 )\>Co 1 max\Q(a)\ 

Claim 9.28. For each 6, the collection A is strictly larger than Aq and is CqC[(1 + M)- 
distributed. 

Proof. The stage no belongs to one of the two cases in Step 3. In both cases, the distinguished 
band (3 is in A \ Aq. 

Given the lower bound (I9.27P in the definition of A, it is enough to show that the column 
largest in norm has an appropriate upper bound. Because we are dealing with subsets of X, 
every band a £ Aq satisfies 

(9.29) \Q e (a)\ < M\Q L (a)\ 
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This implies 

(9.30) max \Q g (a)\ < M max |Q t (a)| 

If the largest column at ttq is in Ao, then estimates 09.221) . (I9.27P and (I9.30P imply that A is 
CoC^M-distributed. So it is also CqC[(1 + M)-distributed and we are done. Hence to finish 
the proof, we shall assume that the largest column is not in Aq. A stage ttq can belong to 
Case (1) or Case (2) of Step 3. We argue the two cases separately. 

Case 1. If He belongs to Case (1), then there is a unique band (3 G S \ Ao that satisfies 
Inequality (I9.25[) . By assumption, the largest column is not in Ao; hence the column Qe(P) 
has to be the largest. At every stage before lie, no column in S\Aq satisfies Inequality (19.251) 
and no column in Ao moves a column in S \ Ao- So we get the bound 

\Qe(P)\ < 2max|Q t (a)| < (1 + M) max \Q t {a)\ 

a€Ao a£Ao 

Combining with Inequality (I9.22p results in the upper bound 

\Q 9 (0)\ < (1 + M) max \Q L (a)\ < C[{1 + M) max \Q{a)\ 

aGAo aeAo 

and so the collection A is CqC[(1 + M)-distributed. 

Case 2. If ttq belongs to Case (2), then in the final split before 7r e , some band a in Ao is 
split by some band (5 in S \ Ao- So it satisfies (I9.26p . Along the sequence n b —►•••—>• tc 9 , no 
band in S \ Ao satisfies Inequality (I9.25P at any intermediate stage. This implies that the 
column Qe{/3) has to be the largest in norm. It also implies that 

(9.31) \Q e -i(f3)\<m a x\Q L (a)\ 
Inequalities (19.221) and (19.311) give the upper bound 

(9.32) \Qo{P)\ < (1 + M) max \Q b (a)\ < C[(l + M) max \Q(a)\ 

a£Ao aeAo 

and so the collection A is CoC[(l + M)-distributed. □ 

Setting ci = (1/4) and C\ = CoC[(l + M) concludes the proof of the Proposition 19.211 

□ 

Iterating Proposition 19.211 we get 

Proposition 9.33. There exist constants c G (0, 1) and C > 1 that depend only on d, and 
a finite number of future stages tt t after tc, such that 

(1) The sets JQ T (W T ) have disjoint interiors and 

^£{JQ T {W T )) >c-i{JQ{W)) 

T 

(2) Each stage tt t is C -distributed 

To prove Theorem 15.101 apply Proposition 19.331 to it to get a set of stages, which we now 
denote by tt^, that are C-distributed. Let be the union of the sets JQ T (W T ). Consider 
the complement JQ(W) \ X' 1 '. Write it as a union of stages with disjoint interiors. Apply 
Proposition 19. 331 to each of them to get a new set of stages, which we denote by ti^\ that are 
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C-distributed. Let X^> denote the union of the sets that correspond to the new C-distributed 
stages. Now consider the complement JQ(W) \ U and iterate the process. 

As a result, we construct an infinite sequence of sets X^ with pairwise intersections of 
measure zero, such that they are a union of C-distributed stages. The set of x e JQ(W) 
whose expansion gets C-distributed at some stage after tt, is equal to the infinite union of 
the sets iW. Let 

n-l 

yW = JQ(W) \ |J X® 

i=i 

By Proposition 19.331 we know that t{Y^ n >) < (1 — djKY^ 1 ^). Hence by induction 

lim £(F (n) ) = 

So the infinite union of the sets X<ri has full measure which concludes the proof of Theo- 
rem 15.101 An immediate consequence of Theorem 15.101 is 

Theorem 9.34. For almost every x £ Wo, the expansion o/x becomes C-distributed infin- 
itely often. 

Proof. Theorem 15.101 says that the set whose expansion never gets C-distributed is measure 
zero. The set of x whose expansion gets C-distributed finitely many times is a countable 
union of such measure zero sets. Hence it is measure zero, proving the theorem. □ 

10. Normality 

In this section, we prove normality, i.e. Theorem 11.21 We now clarify the phrase "non 
transient sequences that can occur" in the statement of Theorem 11.21 Recall that to avoid 
transient vertices, we assumed that the initial stage ttq is in some sink Go of the reduced 
diagram G. So a non-transient sequence that can occur is any finite sequence in Go- 

For every tt' in Go, choose a shortest splitting sequence from tt' to ttq. This gives a finite 
number of chosen splitting sequences. This means that for any C-distributed ir' 

(1) With probability bounded below by a constant d > that depends only on d, it is 
possible to split from the C-distributed tt' to a stage that is combinatorially 7r using 
the chosen splitting sequence. 

(2) Splitting from it' to a stage as above, introduces a bounded amount of distortion in 
the measure, where the bound depends only on d. 

The statements above imply that in Proposition 19.331 f° r different constants c G (0, 1) 
and C > 1 which depend only on d, one gets C-distributed with a relative probability 
greater than c, such that the C-distributed stages are combinatorially tt . In other words, 
the uniformly distorted stages in Theorem 15.101 can be assumed to be combinatorially tt . 
Moreover, because of this additional conclusion in Theorem 15.101 and since Go is strongly 
connected in the directed sense, it is enough to prove normality for finite sequences that 
start from ttq. 

Let j = 7To — > ■ ■ ■ — > tt be a finite sequence of splits. We prove the stronger form of 
normality stated below, which is necessary to prove unique ergodicity. 

Theorem 10.1 (Strong Normality). In almost every expansion, for any finite sequence j 
starting from ttq, there are infinitely many instances in which j immediately follows a C- 
distributed stage. 
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Proof. By Theorem 19.341 for every k e N, there is a k-th instance of C-distribution in the 
expansion of almost every x e Wo. Let vr Xjnfc be the C-distributed stage that is the k-th 
instance. For a finite non-transient sequence j, let Xk denote the set of all x for which the 
expansion after vr x nfe begins with j. Recall from Section [3, that a C-distributed stage is 
C" i_1 -uniformly distorted. So the probability that j follows any C-distributed stage is, up 
to a constant that depends only on C and d, the same as the probability that an expansion 
begins with j. Let this probability be ji > 0. So we have £(Xk) ~ /i. Since 

oo 

^£(X fc ) = oo 

k=l 

by the standard Borel-Cantelli Lemma [9], the set of x that belong to infinitely many X], 
has full measure. □ 



11. Unique Ergodicity 

By the first proposition in Section 4.4 of Yoccoz [TT], for every x, the infinite intersection 
of the nested sequence of sets given by the expansion of x can be identified with the set of 
transverse measures. Consequently, to show unique ergodicity it is enough to show 

Theorem 11.1. For almost every x in Wo, the nested sequence of sets JQ n (W n ) given by 
the expansion o/x satisfy 

f]jQ n (W n )=x 

In other words, almost surely the infinite intersection nests down to a point. 

Proof. It is enough to prove that almost surely, the sets JQ n (A) nest down to a point. To 
do this, suppose there exists a finite sequence j : n nj with the diameter shrinking 

property: There exists a universal positive constant R < 1 such that when any C-distributed 
stage 7r is followed by j, the diameter becomes less than the diameter at 7r by a factor less than 
R. By strong normality, in almost every expansion, the sequence j follows a C-distributed 
stage infinitely often. This implies that for almost every x, the diameter shrinking occurs 
infinitely often and so the infinite intersection of the sets JQ n (A) has diameter zero, i.e. it 
is a point. 

It is not immediate from the definition of C-distribution that a sequence j that shrinks 
diameter by R < 1 at Wo also shrinks diameter at every C-distributed stage by R (up to 
a constant that depends only on C and d), i.e. that j has the diameter shrinking property. 
The subtle issue is that a C-distributed stage is defined using the Jacobian. So it is not 
immediate that distortion is uniform along line segments in A. The following lemma shows 
that for a C-distributed stage, the distortion along line segments in A is C 2 -uniform. 

Let 7r be a C-distributed stage and let Q be the associated matrix. Let L be a line segment 
in A with endpoints ui and u 2 in OA. Denote the unit tangent vector along L from u x to 
u 2 by v. Let wi = Qui and w 2 = Qu 2 . As in Section [TJ denote the full derivative of JQ as 
a map from A to itself by D JQ. 

Lemma 11.2. For any pair of points yi and y 2 in L, 

-L|DJQ( yi )(<;)| < |DJQ(y 8 )(t;)| < C 2 \DJQ( yi )(v)\ 
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Proof. Parameterize Q(L) by f(t) = (1— t)wi+tw 2 . Then g(t) = f{t)/\f(t)\ is a parameteriza- 
tion of JQ(L). By additivity of the norm, \f(t)\ = (1 — £)|wi| + £|w 2 |. Computing the 
derivative with respect to t, we get = |w2| — |wi| This gives 



g'(t) 



\f(t)\f'(t)-f(t)\f(t)\> 
\f(t)\ 2 

((1 - t)\w 2 \ + t|wi|) (w 2 - Wi) - ((1 - t)wi + tw 2 ) (|w 2 | - |wi|) 

\IW 

|wi|w2 — |W2|W1 



\f(t)\ 2 

This means that for any pair of values t\ and t 2 in the interval [0, 1], 

Let h(t) = (1 — t)ui + tu 2 parameterize L. The map JQ restricted to L is given by JQ(y) = 
g o h~ 1 (y). Since h is a parametrization of L with constant speed, the estimate (111.31) 
concludes the proof. □ 

By Corollary 4 from Section 4.3 of Yoccoz [IT], there exists a splitting sequence j from 
7r whose associated matrix has every entry positive. For non-classical interval exchanges, 
the corollary is a consequence of Proposition 19.11 using an argument identical to the original 
proof in Yoccoz. The corollary implies that the image of A under the associated map is in 
the interior of A. We fix this sequence j for the rest of this section. 

Let t a be the vertices JQj(a) of JQ 3 (A). Let s denote the minimum distance between t a 
and dA. The number s is related to the largest entry of Q r 

Extend the edge [t a ,tp] of JQj(a) in either direction to a line segment L[a,0\ with corre- 
sponding endpoints u a and up in <9A. Then the distances d(t a ,u a ) and d{tp,up) are greater 
than s. So the length of [t a , tp] is at most (1 — 2s) times the length of L[a,0\. 

Proposition 11.4. Suppose it is a C-distributed stage and let tt' be the stage obtained by j 
following tt. Then there exists a positive constant R < 1 that depends on s,C and d such 
that 

diam(JQ'(A)) < R diam(JQ(A)) 

Proof. Since JQ'(A) is a simplex, the longest line segment in it is a side joining a pair of 
vertices. Since Q' = Q ■ Qj, we assume that this side is JQ([t a ,tp]). Let v be the unit 
tangent vector along the segment L[a, 0\. By Lemma [11.2[ the distortion of v is C 2 -uniform 
on L[cx, (3\. Combining this with the fact that the length of [t a ,t/j] is at most (1 — 2s) times 
the length of L[a,j3], we get the distance estimate 

d(JQ(t a ),JQ{tp)) C 2 (l-2s) 



d(JQ(u a ),JQ(up)) C 2 (l-2s) + 2s 

Taking R to be the right hand side in the above inequality finishes the proof of Proposi- 
tion QXI □ 

Strong normality applied to the diameter shrinking sequence j implies that diameter 
shrinking happens infinitely often in almost every expansion, and so almost surely, the infinite 
intersection is a point. □ 
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Finally, Theorem 111.1 1 implies that for almost every x, there is a unique transverse probabil- 
ity measure. Consequently, almost every non-classical interval exchange is uniquely ergodic. 

12. Rauzy map on the parameter space 

Consider the full parameter space over a sink for a non-classical interval exchange. It is 
a disjoint union of configuration spaces, each associated to different combinatorial types in 
the sink. Rauzy induction induces a map from this parameter space to itself described in 
Section [5j Each configuration space in the full parameter space carries a natural Lebesgue 
measure. A consequence of Theorems 19 . 341 and 1 1 1 . 1 1 is 

Theorem 12.1. The Lebesgue measure on the full parameter space is ergodic with respect 
to the Rauzy map. 

Proof. Let A be a Borel set in the full parameter space invariant under the Rauzy map and 
suppose that A is not full measure. Then A is not full measure in some configuration space 
W corresponding to the combinatorial type it. Let x be a point of Lebesgue density for the 
complement W \ A. Given 1/2 > e > 0, there is a ball -B(x) centered at x such that the 
proportion of A in this ball is less than e. By Theorems 19 . 341 and 111.11 the ball -B(x) can be 
closely approximated by a union of C-distributed stages contained in £>(x). Let us suppose 
the approximation is chosen so that the union of these C-distributed stages has measure 
greater than (1 — e)£(£>(x)). If the proportion of A in each of these C-distributed stages 
were greater than 2e, then that would imply that the proportion of A in B(x) is greater than 
2e(l — e) > e, a contradiction. This means that the proportion of A in at least one of the 
C-distributed stages is at most 2e. Call this stage T. 

Let n' be any combinatorial type in the sink, and let W be the associated configuration 
space. Due to the strong connectivity of the sink, it may be assumed that T is actually 
obtained by a splitting sequence j starting at ir and terminating in it'. 

By the invariance of A, the proportion of A in W is at most 2e. Since e can be made 
arbitrarily small and it' is any combinatorial type in the sink, the conclusion is that A must 
have measure zero. Thus the only invariant sets for the Rauzy induction on the parameter 
space associated to a sink have zero or full measure i.e. the Rauzy map is ergodic. □ 

13. Linear involutions 

Non-classical interval exchanges are linear involutions [TJ that correspond to the principle 
stratum of quadratic differentials. In [TJ, Boissy and Lanneau establish criteria for irreducibil- 
ity of linear involutions. These criteria indicate when a particular linear involution either 
sits in a sink or always splits to end up in some sink. They give two notions for irreducibility 
of linear involutions: geometric and dynamical. The geometric notion of irreducibility is the 
strongest and implies dynamical irreducibility. It also implies that the Veech construction 
for suspending the linear involution holds and that the orbits for almost all starting widths 
are minimal. Moreover, a geometrically irreducible linear involution always belongs to some 
sink for the associated Rauzy diagram. The sinks are called Rauzy classes in literature and 
they classify the connected components of the associated stratum of quadratic differentials. 

On the other hand, dynamical irreducibility is a weaker notion and does not define a Rauzy 
class unless the width data comes from a suitably restricted set called the admissible set. In 
many situations, the admissible set has full measure. But there are instances when this is not 
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so. It turns out that a linear involution that is dynamically irreducible and has admissible 
widths eventually splits down to a linear involution that is geometrically irreducible. More 
importantly for us, a dynamical irreducible linear involution that has admissible widths is 
also measure irreducible in our sense. In particular, this means that if we restrict to some 
sink, any geometrically irreducible involution that sits in it, is measure irreducible in our 
sense. 

To prove the theorems for non-classical interval exchanges, the logical correlations we used 
were as follows: Completeness of the underlying train tracks implies measure irreducibility, 
measure irreducibility implies Proposition 19. 11 and finally Proposition 19. II combined with the 
analysis of the restriction of the Jacobian gives the proofs of the main theorems. 

So to generalize the results to linear involutions, it is enough to restrict to some sink 
in the associated Rauzy diagram. Then the linear involutions considered are geometrically 
irreducible and hence measure irreducible. Consequently, all the theorems are true for geo- 
metrically irreducible linear involutions. 
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